Goto

Collaborating Authors

 model 3


Shift is Good: Mismatched Data Mixing Improves Test Performance

Medvedev, Marko, Lyu, Kaifeng, Li, Zhiyuan, Srebro, Nathan

arXiv.org Machine Learning

We consider training and testing on mixture distributions with different training and test proportions. We show that in many settings, and in some sense generically, distribution shift can be beneficial, and test performance can improve due to mismatched training proportions, even if the components are unrelated and with no transfer between components. In a variety of scenarios, we identify the optimal training proportions and the extent to which such distribution shift can be beneficial. We show how the same analysis applies also to a compositional setting with differing distribution of component "skills'' at training and test.


SHA-256 Infused Embedding-Driven Generative Modeling of High-Energy Molecules in Low-Data Regimes

Verma, Siddharth, Alankar, Alankar

arXiv.org Artificial Intelligence

High-energy materials (HEMs) are critical for propulsion and defense domains, yet their discovery remains constrained by experimental data and restricted access to testing facilities. This work presents a novel approach toward high-energy molecules by combining Long Short-Term Memory (LSTM) networks for molecular generation and Attentive Graph Neural Networks (GNN) for property predictions. We propose a transformative embedding space construction strategy that integrates fixed SHA-256 embeddings with partially trainable representations. Unlike conventional regularization techniques, this changes the representational basis itself, reshaping the molecular input space before learning begins. Without recourse to pretraining, the generator achieves 67.5% validity and 37.5% novelty. The generated library exhibits a mean Tanimoto coefficient of 0.214 relative to training set signifying the ability of framework to generate a diverse chemical space. We identified 37 new super explosives higher than 9 km/s predicted detonation velocity.


The Tesla Model Y and Model 3 Standard Are Cheaper--but Still Not Cheap

WIRED

The electric vehicle tax credit is gone, and Tesla's new, more affordable models don't quite close the gap. For nearly two decades, CEO Elon Musk has promised Tesla would make a more affordable electric vehicle, to, as he put it in 2006, "help expedite the move from a mine-and-burn hydrocarbon economy towards a solar electric economy." On Tuesday, Tesla announced a new Model Y and Model 3 Standard, versions of its popular compact SUV and sedan stripped of a few higher-end touches and features to bring the price down to $39,990 and $36,990, respectively. They're both about $5,000 cheaper than the Premium variants, which goes a ways--but not all the way--toward recouping the $7,500 tax credit canceled by the GOP-led Congress this past summer . The price point also puts Tesla's newest models firmly in the "more affordable" EV camp.


Tesla unveils new lower-cost Model Y amid rising competition

Al Jazeera

Tesla unveiled more affordable versions of its best-selling Model Y SUV and its Model 3 sedan at $39,990 and $36,990, respectively, as the electric vehicle (EV) manufacturer seeks to reverse falling sales and waning market share amid rising competition. The EV maker announced its new models on Tuesday. Late last year, Musk said the vehicle would be priced below the "key threshold" of $30,000, including US EV tax credits. In the United States, prices effectively rose by $7,500 at the end of last month, when the EV tax credit ended. That helped goose quarterly sales to a record, but expectations are that they will slow down for the rest of the year, unless the affordable car comes to the rescue.


Socio-Economic Model of AI Agents

Qian, Yuxinyue, Liu, Jun

arXiv.org Artificial Intelligence

Modern socio-economic systems are undergoing deep integration with artificial intelligence technologies. This paper constructs a heterogeneous agent-based modeling framework that incorporates both human workers and autonomous AI agents, to study the impact of AI collaboration under resource constraints on aggregate social output. We build five progressively extended models: Model 1 serves as the baseline of pure human collaboration; Model 2 introduces AI as collaborators; Model 3 incorporates network effects among agents; Model 4 treats agents as independent producers; and Model 5 integrates both network effects and independent agent production. Through theoretical derivation and simulation analysis, we find that the introduction of AI agents can significantly increase aggregate social output. When considering network effects among agents, this increase exhibits nonlinear growth far exceeding the simple sum of individual contributions. Under the same resource inputs, treating agents as independent producers provides higher long-term growth potential; introducing network effects further demonstrates strong characteristics of increasing returns to scale.


A Hybrid Surrogate for Electric Vehicle Parameter Estimation and Power Consumption via Physics-Informed Neural Operators

Lim, Hansol, Choi, Jongseong Brad, Lee, Jee Won, Jeoung, Haeseong, Han, Minkyu

arXiv.org Artificial Intelligence

We present a hybrid surrogate model for electric vehicle parameter estimation and power consumption. We combine our novel architecture Spectral Parameter Operator built on a Fourier Neural Operator backbone for global context and a differentiable physics module in the forward pass. From speed and acceleration alone, it outputs time-varying motor and regenerative braking efficiencies, as well as aerodynamic drag, rolling resistance, effective mass, and auxiliary power. These parameters drive a physics-embedded estimate of battery power, eliminating any separate physics-residual loss. The modular design lets representations converge to physically meaningful parameters that reflect the current state and condition of the vehicle. We evaluate on real-world logs from a Tesla Model 3, Tesla Model S, and the Kia EV9. The surrogate achieves a mean absolute error of 0.2kW (about 1% of average traction power at highway speeds) for Tesla vehicles and about 0.8kW on the Kia EV9. The framework is interpretable, and it generalizes well to unseen conditions, and sampling rates, making it practical for path optimization, eco-routing, on-board diagnostics, and prognostics health management.


Tesla vs Britain's most confusing junction: Self-driving car takes on Swindon's Magic Roundabout - so, can you guess who wins?

Daily Mail - Science & tech

It has been dubbed'Britain's most confusing junction', thanks to its complex system of mini–roundabouts. But while many drivers struggle to navigate their way around Swindon's Magic Roundabout, the junction proved to be light work for a self–driving car. To put its Full Self Driving (FSD) mode to the test, Tesla sent a Model 3 through the complex intersection. Footage shows the car expertly navigating the roundabout – not just once, but three times – as cars continuously join from seemingly every direction. Fans have flocked to X to discuss the feat, with one calling it'superb'.


Benchmarking Akan ASR Models Across Domain-Specific Datasets: A Comparative Evaluation of Performance, Scalability, and Adaptability

Mensah, Mark Atta, Wiafe, Isaac, Ekpezu, Akon, Appati, Justice Kwame, Abdulai, Jamal-Deen, Wiafe-Akenten, Akosua Nyarkoa, Yeboah, Frank Ernest, Odame, Gifty

arXiv.org Artificial Intelligence

Most existing automatic speech recognition (ASR) research evaluate models using in-domain datasets. However, they seldom evaluate how they generalize across diverse speech contexts. This study addresses this gap by benchmarking seven Akan ASR models built on transformer architectures, such as Whisper and Wav2Vec2, using four Akan speech corpora to determine their performance. These datasets encompass various domains, including culturally relevant image descriptions, informal conversations, biblical scripture readings, and spontaneous financial dialogues. A comparison of the word error rate and character error rate highlighted domain dependency, with models performing optimally only within their training domains while showing marked accuracy degradation in mismatched scenarios. This study also identified distinct error behaviors between the Whisper and Wav2Vec2 architectures. Whereas fine-tuned Whisper Akan models led to more fluent but potentially misleading transcription errors, Wav2Vec2 produced more obvious yet less interpretable outputs when encountering unfamiliar inputs. This trade-off between readability and transparency in ASR errors should be considered when selecting architectures for low-resource language (LRL) applications. These findings highlight the need for targeted domain adaptation techniques, adaptive routing strategies, and multilingual training frameworks for Akan and other LRLs.


Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning

Wang, Liying, D., Ph., Carrington, Daffodil, S., M., Filienko, Daniil, S., M., Jazmi, Caroline El, S., M., Xie, Serena Jinchen, S., M., De Cock, Martine, D., Ph., Iribarren, Sarah, D., Ph., Yuwen, Weichao, D, Ph.

arXiv.org Artificial Intelligence

Family caregivers often face substantial mental health challenges due to their multifaceted roles and limited resources. This study explored the potential of a large language model (LLM)-powered conversational agent to deliver evidence-based mental health support for caregivers, specifically Problem-Solving Therapy (PST) integrated with Motivational Interviewing (MI) and Behavioral Chain Analysis (BCA). A within-subject experiment was conducted with 28 caregivers interacting with four LLM configurations to evaluate empathy and therapeutic alliance. The best-performing models incorporated Few-Shot and Retrieval-Augmented Generation (RAG) prompting techniques, alongside clinician-curated examples. The models showed improved contextual understanding and personalized support, as reflected by qualitative responses and quantitative ratings on perceived empathy and therapeutic alliances. Participants valued the model's ability to validate emotions, explore unexpressed feelings, and provide actionable strategies. However, balancing thorough assessment with efficient advice delivery remains a challenge. This work highlights the potential of LLMs in delivering empathetic and tailored support for family caregivers.


How high is `high'? Rethinking the roles of dimensionality in topological data analysis and manifold learning

Sansford, Hannah, Whiteley, Nick, Rubin-Delanchy, Patrick

arXiv.org Machine Learning

We present a generalised Hanson-Wright inequality and use it to establish new statistical insights into the geometry of data point-clouds. In the setting of a general random function model of data, we clarify the roles played by three notions of dimensionality: ambient intrinsic dimension $p_{\mathrm{int}}$, which measures total variability across orthogonal feature directions; correlation rank, which measures functional complexity across samples; and latent intrinsic dimension, which is the dimension of manifold structure hidden in data. Our analysis shows that in order for persistence diagrams to reveal latent homology and for manifold structure to emerge it is sufficient that $p_{\mathrm{int}}\gg \log n$, where $n$ is the sample size. Informed by these theoretical perspectives, we revisit the ground-breaking neuroscience discovery of toroidal structure in grid-cell activity made by Gardner et al. (Nature, 2022): our findings reveal, for the first time, evidence that this structure is in fact isometric to physical space, meaning that grid cell activity conveys a geometrically faithful representation of the real world.